Domain Specific Ontology Extractor For Indian Languages
نویسندگان
چکیده
We present a k-partite graph learning algorithm for ontology extraction from unstructured text. The algorithm divides the initial set of terms into different partitions based on information content of the terms and then constructs ontology by detecting subsumption relation between terms in different partitions. This approach not only reduces the amount of computation required for ontology construction but also provides an additional level of term filtering. The experiments are conducted for Hindi and English and the performance is evaluated by comparing resulting ontology with manually constructed ontology for Health domain. We observe that our approach significantly improves the precision. The proposed approach does not require sophisticated NLP tools such as NER and parser and can be easily adopted for any language.
منابع مشابه
Public Transport Ontology for Passenger Information Retrieval
Passenger information aims at improving the user-friendliness of public transport systems while influencing passenger route choices to satisfy transit user’s travel requirements. The integration of transit information from multiple agencies is a major challenge in implementation of multi-modal passenger information systems. The problem of information sharing is further compounded by the multi-l...
متن کاملImproving Classification of Multi-Lingual Web Documents using Domain Ontologies
In this paper, we deal with the problem of analyzing and classifying web documents to several major categories/classes in a given domain using domain ontology. We present the ontology-based web content mining methodology that contains such main stages as collecting a training set of labeled documents from a given domain, building a classification model above this domain given the domain ontolog...
متن کاملUsing the Ontology Paradigm to Integrate Information Systems Oveia: Expanding the Topic Maps frontier
Ontology based websites are one possible implementation of the Semantic Web. There are several languages for ontology specification: RDF, OWL, Topic Maps. Topic Maps follow a structure formally specified what makes them a good choice for semantic website specification. The process of ontology development based in topic maps is complex, time consuming, and it requires a lot of human and financia...
متن کاملImproving the Search for Learning Objects with Keywords and Ontologies
We report on an ongoing project which aims at improving the e ectiveness of retrieval and accessibility of learning object within learning management systems and learning object repositories. The project Language Technology for eLearning approaches this task by providing Language Technology based functionalities and by integrating semantic knowledge through domain-speci c ontologies. We will re...
متن کاملClassification of Web Documents Using Concept Extraction from Ontologies
In this paper, we deal with the problem of analyzing and classifying web documents in a given domain by information filtering agents. We present the ontology-based web content mining methodology that contains such main stages as creation of ontology for the specified domain, collecting a training set of labeled documents, building a classification model in this domain using the constructed onto...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012